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A HIDDEN SIGNAL IN THE ULAM SEQUENCE 


STEFAN STEINERBERGER 


Abstract. The Ulam sequence is defined as ai = l,a 2 = 2 and an being the smallest integer 
that can be written as the sum of two distinct earlier elements in a unique way. This gives 

1, 2, 3,4, 6,8,11,13,16,18, 26, 28, 36,38, 47,... 

Ulam remarked that understanding the sequence, which has been described as ’quite erratic’, 
seems difficult and indeed nothing is known. We report the empirical discovery of a surprising 
global rigidity phenomenon: there seems to exist a real q ~ 2.5714474995 ... such that 

{ctan mod 27r : n G N} generates an absolutely continuous non-uniform measure 
supported on a subset of T. Indeed, for the first 10^ elements of Ulam’s sequence, 
cos (2.5714474995 On) < 0 for all ^ {2, 3,47, 69} . 

The same phenomenon arises for some other initial conditions ai, a 2 : the distribution functions 
look very different from each other and have curious shapes. A similar but more subtle phe¬ 
nomenon seems to arise in Lagarias’ variant of MacMahon’s ’primes of measurement’ sequence. 


1. Introduction 

Stanislaw Ulam introduced his sequence 


1,2,3,4,6,8,11,13,16,18,26,28,36,38,47,48,53,57,62,69,72,77,82,87,97. 


in a 1964 survey m on unsolved problems. The construction is given by oi = 1,02 = 2 and then 
iteratively choosing the smallest integer that can be written as the sum of two distinct earlier 
elements in a unique way as the next element. Ulam writes 

One can consider a rule for growth of patterns - in one dimension it would be 
merely a rule for obtaining successive integers. [...] In both cases simple questions 
that come to mind about the properties of a sequence of integers thus obtained 
are notoriously hard to answer. (Ulam, 1964) 

He asks in the Preface of [55] whether it is possible to determine the asymptotic density of the 
sequence (this problem is sometimes incorrectly attributed to Recaman |16jl. The Ulam sequence 
was first extensively computed in 1966 by Muller [HEO], who gave the first 20.000 terms and 
predicted density 0. The most extensive computation we had access to is due to Daniel Strottman 
[26] . who computed the first 11.172.261 elements (up to 150.999.995). This data set shows that 
some of Muller’s predictions are not correct (the density, for example, seems to be very stable and 
around 0.0739). Different initial values oi, 02 can give rise to more structured sequences [51151 ITS]: 
for some of them the sequence of consecutive differences a„+i — a„ is eventually periodic. It seems 
that this is not the case for Ulam’s sequence: Knuth [T8| remarks that 04953 — 04952 = 262 and 
018858 ~ 018857 = 315. The sequence ’does not appear to follow any recognizable pattern’ |4] and 
is ’quite erratic’ [53]. We describe the (accidental) discovery of some very surprising structure. 
While using Fourier series with Ulam numbers as frequencies, we noticed a persisting signal in the 
noise: indeed, dilating the sequence by a factor a ~ 2.5714474995 ... and considering the sequence 
(aOn mod 27r) gives rise to a very regular distribution function. One surprising implication is 

cos (2.5714474995 o„) < 0 for the first 10^ Ulam numbers except 2,3,47,69. 


The dilation factor a seems to be a universal constant (of which we were able to compute the first 
few digits); given the distribution function, there is nothing special about the cosine and many 
similar inequalities could be derived (we used the cosine because it is perhaps the simplest). 
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2. The Observation 

2.1. A Fourier series. If the Ulam sequence had positive density, then by a classical theorem 
of Roth [2d) it would contain many 3-arithmetic progressions. We were interested in whether the 
existence of progressions would have any impact on the structure of the sequence (because, if 
a • n -|- 6 is in the sequence for n = 1, 2, 3, then a or 2a is not). There is a well-known connection 
between randomness in sets and smallness of Fourier coefficients and this motivated us to look at 

N N 

fN{x) = Re ^ ^ cos {anx). 

n—1 n—1 

Clearly, /Ar(0) = N and ||/Ar||_L 2 ~ '/N and therefore we expected |/Ar(a;)| ~ VN for x outside of 
0. Much to our surprise, we discovered that this is not the case and that there is an a ~ 2.571... 
with that /at (a) ~ —0.8N. Such a behavior is, of course, an indicator of an embedded signal. 




2.2. The hidden signal. An explicit computation with N 
signal at 


a ~ 2.5714474995 ... 


10 ^ pinpoints the location of the 


and, by symmetry, at 2tt — a. This signal acts as a hidden shift in frequency space: we can remove 
that shift in frequency by considering 



aun I 


: 1 < n < 


instead. 



Figure 3. Distribution of Sn on [0,27r] for N = 10^. 

A priori it might be reasonable to suspect that the Ulam sequence is ‘pseudo-random’ (in the 
same way as Cramer’s model suggests ‘pseudo-randomness’ of the primes). There are some local 
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obstructions (for example both primes and the Ulam sequence do not contain a triple (n, n+2, n+4) 
for n > 3) but it would a priori be conceivable that many statistical quantities could be predicted 
by a random model. Our discovery clearly shows this to be false: the Ulam sequence has some 
extremely rigid underlying structure. We are hopeful that the rigidity of the phenomenon will 
provide a first way to get a more rigorous understanding of the Ulam sequence; however, we 
also believe that the phenomenon might be related to some interesting mechanism in additive 
combinatorics and of independent interest. 


2.3. a and the power spectrum. Gibbs [8] has used this phenomenon as a basis for an algorithm 
that allows for a faster computation of Ulam numbers. Jud McCranie informed me that he and 
Gibbs have independently computed a large number of Ulam numbers (more than 2.4 billion) and 
verified the existence of the phenomenon up to that order. McGranie gives the bounds 

2.57144749846 < a < 2.57144749850. 


We do not have any natural conjecture for a closed-form expression of a. The value a is not 
unique in the sense that there are other values x such that 

N 

COS (xUn) CxN seems to exhibit linear growth, 

n—1 

however, these other values can be regarded as the effect of an underlying symmetry. Sinan 
Giintiirk (personal communication) observed that these other values seem to be explicitly given 
by {ka mod 2tt : k G Z}. If there is weak convergence of the empirical distribution, i.e. if 


1 ^ 
n—1 


/ic on T, then 


1 ^ 
lim — 

AT—>-oo N ^^ 
n—1 


cos {aan) = / cos {x)d^o 

Jt 


Gomputationally, our strategy consists of looking for nonzero values of the integral (whose value 
is 0 in the generic case, where /Xq, is a uniform distribution). In this framework, the frequency 
a ~ 2.571... is the one for which the phenomenon is most pronounced, however, there are other 
peaks. It is easy to see that if we have convergence to an absolutely continuous measure 

^ y] f{x)dx, then for alH G Z ^ ^ X! ^ 

n=l n=l \ k=0 ' ^ / 

and if fj, = f{x)dx is unevenly distributed in the sense that 

0 ^ / cos {x)dfj. = / cos {x)f{x)dx, 

Jt Jt 

then the same relation can be expected for the new density function. There are functions 


1 1 f 

f — for which nonetheless - > f 
•'^271 

k=0 


2Trk\ 


1 


however, the set of such functions is very small and not stable under perturbations. Generically, 
if f{x)dx is not the uniform distribution, then neither is the new distribution. In the case of the 
Ulam sequence (a„) it is thus to be expected that the signals at frequency ia mod 27r generated out 
of the initial measure (Fig.3) located at frequency a are never uniformly distributed. Numerically, 
this holds and 

N 

cos {(.aan) ceN. 

n—1 


i 

0 

I 

2 

3 

4 

5 

6 

7 

8 

ce 

I 

-0.794 

0.288 

0.253 

-0.578 

0.580 

-0.344 

0.057 

0.118 


Table 1. Empirical approximations of ci (and C-i = ce). 
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Numerical computation suggests that indeed all peaks in the spectrum are described by the set 
{ka mod 2tt : k G Z} which suggests that there is indeed only one hidden signal (shown in Figure 
3) at frequency a ~ 2.571... while all other peaks are explained by the symmetry described above. 
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Figure 4. Distribution at frequency 4a. Figure 5. Distribution at frequency 5a. 


C 4 and C 5 are especially large compared to other values (though not as big as ci ~ 2.571...). 
This seems to be without deeper meaning: the distributions arising for those values are mainly 
supported in regions where the cosine is negative and positive, respectively. It is easy to see that 
under suitable assumptions on the initial measure we get that > 0 as £ —> 00. 

2.4. Other initial values. There has been quite some work on the behavior of Ulam-type se¬ 
quences using the same construction rule with other initial values. It has first been observed by 
Queneau [TS] that the initial values ( 01 , 02 ) = (2,5) give rise to a sequence where o„_|_i — o„ is 
eventually periodic. Finch Elisiil] proved that this is the case whenever only finitely many even 
numbers appear in the sequence and conjectured that this is the case for (oi, 02) = (2, n) whenever 
n > 5 is odd. Finch’s conjecture was proved by Schmerl & Spiegel [Mj. Subsequently, Cassaigne 
& Finch [ 1 ] proved that all sequences starting from ( 01 , 02 ) = (4, n) with n = 1 (mod 4) contain 
precisely three even integers and are thus eventually periodic. The sequence (01,02) = (2,3) as 
well as all sequences ( 01 , 02 ) = (1,r) with n S N do not exhibit such behavior and are being 
described ’erratic’ in the above literature. 



Figure 6. Distribution for initial values Figure 7. Distribution for the initial val- 

(oi, 02 ) = (1, 3) for N = 2.5 ■ 10®. ues ( 01 , 02 ) = (1,4) for N = 3.9 ■ 10®. 

The underlying mechanism appears to be independent of the initial values (as long as the sequence 
is ’chaotic’ in the sense of not having periodic consecutive differences). Using N = 2.7 • 10® and 
N = 3.9 ■ 10® elements, respectively, we found that the frequencies for the erratic sequences with 
( 01 , 02 ) = (1,3) and ( 01 , 02 ) = (1,4) are given by 

a(i, 3 ) ~ 2.83349751... and a(i, 4 ) - 0.506013502 ... 
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and removing that hidden frequency gives rise to the two distributions Fig. 5 and Fig. 6. Another 
example is given by initial conditions (2, 3) with N = 5.7 • 10® elements, 

0(2,3) -- 1.1650128748... 



Figure 8. Distribution for the initial values (2, 3) for N = 5.7 • 10®. 

2.5. Additional remarks. A classical theorem of Hermann Weyl [55] states that if is a 

sequence of distinct positive integers, then the set of a G K for which the sequence 

(aa„ mod 27r)^j^ is not uniformly distributed 

has measure 0. Conversely, given any absolutely continuous measure p, on T and a fixed a such 
that a/(27r) is irrational, it is not difiicult to construct a sequence of integers a„ such that 
(oon mod 27r)^]^ is distributed according to p, by using the uniform distribution of {ka)keN- 
Naturally, these sequences are non-deterministic and quite artificial. 

Question. Are there any other sequences (a„)„gN of integers appearing ’naturally’ 
with the property that there exists a real a > 0 such that 

(aa„ mod has an absolutely continuous non-uniform distribution? 

We expect this property to be exceedingly rare. 

3. Other examples of curious behavior 

The author, after discovering the Ulam sequence phenomenon, tried a very large number of dif¬ 
ferent sequences arising in combinatorics and number theory in the hope of uncovering a second 
example. We have not found any other sequence with that property. Generally, a natural case 
distinction was observed: 

(1) cos (oia;) -I- cos (a 2 x) -|- • • • -I- cos {a^x) appears random 

(2) cos {aix) cos {a 2 x) -|- • • • -f cos (onx) has peaks at 27r/fc for some k gN 

(3) cos (oia;) -I- cos {a 2 x) -|- • • • -I- cos {a^x) exhibits other behavior. 

The first type of behavior is generic and observed for most sequences. The second type of behavior 

merely indicates irregularities in the distribution of the sequence modulo k and is easily observed 
in many sequences. We present one known example (based on work of Reznick [22j . The third 
type seems more complicated and it is not clear what type of phenomena one could encounter. 
In trying a very large number of examples, we only found the following three examples and they 
appear to be very different from each other: 

• a fixed, stationary distribution (Ulam sequence) 

• a fixed distribution on top of the uniform distribution, which weakens as more elements 
are added (the zeroes of the Riemann ^-function on the critical line, explained by Landau 
m, Ford & Zaharescu i) 
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• different intensities of randomness (Lagarias’ modification of ’primes of measurement’) 

3.1. Example 1: Stern diatomic sequence (Dijkstra, Reznick). We start with an interest¬ 
ing example of a sequence having irregulaties in residue classes. The Stern diatomic sequence is 
defined by oq = 0, oi = 1 and 

{ a „/2 if n is even 

tt n-i + Ori+i if n is odd. 

2 2 

The sequence has a surprising number of combinatorial properties. Plotting the associated cosine 
sum easily identifies peaks (of decreasing height) at 27r/3, 27r/5,27r/7,... - it is easy to find the 
origin of these peaks: elements of the Stern sequence are equally likely to be = z (mod p) as 
long as i ^ 0 but they are slightly less likely to be divisible by p. The simplest possible case 
(2|a„ if and only if 3|n) was observed by Dijsktra [2] in 1976 (with very similar arguments being 
already contained in the original paper of Stern [25)1. The fact that the phenomenon persists mod 
p ^ 2 seems to have first been discovered and proven by Reznick [52] in 2008. This example is 
representative: typically, if there any peaks, they are caused by an irregular distribution mod p. 



3.2. Example 2: Zeroes of the Riemann (^—function (Landau, Ford-Zaharescu). This 
section does not contain any new results but describes known result in our framework. We now 
discuss an interesting example of a sequence that exhibits a fixed distribution that weakens as 
more elements are added: the sequence tn of imaginary parts of zeroes Q{l/2 + itn) = 0 of the 
Riemann zeta function on the critical line. The ^—function is defined via 


C(s) = — and we are interested in 

n>l 



t gR. 


The first few zeroes (t„) are roughly located at 

~ 14.13, 21.02, 25.01, 30.42, 32.93, 37.58,... 

It is well-known that 


#<{te [0,T] :C( i+^^) =0 


TlogT 

27r 


as T —)• oo. 


This means that the density of zeroes ti < t 2 < < ... on the critical line is increasing and their 

consecutive differences are shrinking like 1 / log T. The distribution of consecutive differences of 
properly renormalized roots has been intensively studied ever since Montgomery m discovered 
a connection to distributions arising in the Gaussian Unitary Ensemble (GUE). This is of great 
relevance in light of the Hilbert-Polya question whether the roots correspond to the eigenvalues 
of a self-adjoint linear operator. Gomputing the cosine sum using the zeroes as frequencies results 
in a most interesting picture (Fig. 11); the spikes are centered at logp™ for p prime and m S N. 
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Figure 11. Value of the cosine sum on the Figure 12. The distribution of 
interval 0.1 < x < tt — 0.1 using 10® roots. (log5)t„ mod 27r for the first 10® roots. 


This is a consequence of a result of Landau who proved [without assuming the Riemann 
hypothesis) 


N 

n—l 



tN logP 

2-77 y/log 
/ logN \ 

V %/log P™ / 


+ 0 


( 


logAf A 
Vlog p™ J 


if X = logp"^ 
otherwise. 


Note that the sum ranges over JV elements and the leading order term for x = logp™ is given by 
fw ^ TV/log V. In order for this to happen at least V/log TV out of the first TV zeroes of the zeta 
function have to align in a nontrivial way when considered as log mod 27r (see Table 1). 

Since N/ log V <C iV that alignment weakens as more zeroes are being added. The distribution at 
scale N/ log N was found by Ford & Zaharescu [5] . 


There is a simple heuristic for this clustering using Euler’s product formula 

„ 1 




— p~^ 

</(s) = 0 means that the infinite product converges to 0 (no factors vanishes), which requires the 
prime numbers to be suitably aligned. The relationship with the observed alignment is due to 


1 


1 _p-(5+*‘") 


< 1 if ^ < [(logp)t„ mod 27r] < 


a 

log 2 

log 3 

log 5 

log? 

log 8 

log 9 

log 11 

e 

V5 

# points 

53258 

54392 

55123 

55336 

51883 

52572 

55398 

49992 

50086 


Table 2. Number of elements in {aU mod 27r : 1 < i < 10®} that lie in [7r/2,37r/2]. 


3.3. Examples: Lagarias’variant of MacMahon’s primes of measurement. MacMahon’s 
’primes of measurement’, sometimes called segmented numbers, are given by the sequence 

1, 2,4, 5,8,10,14,15,16,21,22,25,26,28,... 

generated by excluding all sums of two or more consecutive earlier elements of the sequence (see 
Guy’s Unsolved Problems in Number Theory [101 Section E30]). George Andrews conjectures that 

nlogn 
” log log n 

The sequence is very different from the Ulam sequence but certainly similar in spirit. The cosine 
sum with these terms does appear to be random, however, we also remark that the publicly 
available data set [20] we used only contains about 7000 terms and larger compilations may 
provide more insight. Lagarias m proposes excluding only sums of two or three consecutive 


































earlier members (which certainly increases the similarity to the definition of the Ulam sequence), 
the arising sequence being 

1, 2,4, 5,8,10,12,14,15,16,19,20,21,24,25, 27, 28,... 

We used a publicly available list of the first 10.000 elements [U]. First of all we note a peak at tt 
corresponding to an uneven distribution in Z 2 : indeed, it seems that elements of the sequence are 
more likely to be even than odd (with ^ 54.86% of elements being even). 



Figure 13. Cosine sum on [0.03, tt — 0.03] for the first 5.000 and 10.000 elements. 

The behavior away from the peak is more interesting: it seems as if 

N 

cos (a„x) 

n—1 

but that the sum is experiencing smaller levels of fluctuations for small x and larger levels for 
large x. There seems to be a transition occuring somewhere around a: ^ 1 . The absolute value 
of the sum seems to be indeed of order ~ '/N and, as a consequence, plotting the distribution of 
{xQn mod 27r} on the torus shows essentially a uniform distribution. 

Acknowledgement. The result for the Ulam sequence was originally discovered using only the 
first 10.000 numbers for each set of initial conditions; the precision of the results presented here 
would not have been possible without the data sets compiled and generously provided by Daniel 
Strottman for all initial conditions that were discussed. Sinan Giintiirk observed the arithmetic 
regularity of the location of the peaks which gave rise to Section 2.3. Bruce Reznick was very 
helpful in explaining the early history of the Stern sequence. Jud McCranie informed the author of 
additional data and performed additional tests on them providing a better estimate for q;(i_ 2 ). The 
computations involving the C—function were carried out using Oldyzko’s list [T7] of the first 100.000 
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